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Abstract — Motivated by fading channels and erasure channels, 
the problem of reliable communication over deterministic relay 
networks is studied, in which relay nodes receive a function of 
the incoming signals and a random network state. An achievable 
rate is characterized for the case in which destination nodes have 
full knowledge of the state information. If the relay nodes receive 
a linear function of the incoming signals and the state in a finite 
field, then the achievable rate is shown to be optimal, meeting 
the cut-set upper bound on the capacity. This result generalizes 
on a unified framework the work of Avestimehr, Diggavi, and 
Tse on the deterministic networks with state dependency, the 
work of Dana, Gowaikar, Palanki, Hassibi, and Effros on linear 
erasure networks with interference, and the work of Smith and 
Vishwanath on linear erasure networks with broadcast. 

I. Introduction 

In their celebrated paper [lj that opened the field of network 
coding, Ahlswede et al. found the multicast capacity of 
wireline networks. For wireless networks, however, there are 
some new challenges for reliable communication compared 
to the wireline network. Among them are broadcast and 
interference, and there has been some work that deals with 
these two features. In [2], the multicast capacity was shown for 
networks that have deterministic channels with broadcast, but 
without interference at the receivers. Deterministic networks 
were further studied in [3 ] to incorporate interference at the 
receiving nodes, where the capacity for linear finite field 
networks was found. These rather simple models were shown 
to give good insights in solving real- world network problems. 
For example in [4j, Avestimehr et al. were able to approxi- 
mately characterize the capacity of Gaussian relay networks 
within some constant gap using a similar approach used for 
deterministic networks. Although previous models consider 
broadcast and interference, they did not explicitly consider 
another important feature in wireless communications. The 
wireless medium in real- world communications suffer fading, 
which in turn cause severe degradation of the transmitted 
signal. Although the deterministic model can be a good 
abstraction in understanding broadcast and interference, it does 
not fully capture the effect of fading in wireless networks. In 
this sense, the erasure network in which transmitted symbols 
get erased at random provides a simple model that captures 
the fading characteristics. In Q, Dana et al. considered the 
erasure networks with broadcast and no interference, where the 



erasures are at the traversing edges. Smith and Vishwanath (6| 
considered an erasure network without broadcast, where the 
interference is modeled as a linear finite field sum of incoming 
signals that are not erased. In both and (6), if the destination 
node has perfect knowledge of the state information, they 
showed that the capacity is given by the cut-set bound. 

In this paper, we consider a deterministic network in which 
the observation at each node is a function of the incoming 
signals and a random state. The channel state affecting the 
relay and destination nodes is assumed to be perfectly known 
at the destinations. We give an achievable rate for this class 
of networks, and show that the associated coding scheme 
achieves the capacity for the case in which the relay and 
destination nodes receive a linear function of the incoming 
signals and the state over a finite field. This result generalizes 
the work of Dana et al. and the work of Smith and Vishwanath 
on linear erasure networks to handle both interference and 
broadcast. As for deterministic networks, our result generalizes 
the work of Avestimehr, Diggavi, and Tse on the deterministic 
networks to deterministic state-dependent networks. 

II. Problem Statement and Preliminaries 

In the following we will give useful definitions for later use. 
Upper case letters denote random variables (e.g., X, Y, S) and 
lower case letters represent scalars (e.g., x, y, s). Calligraphic 
letters (e.g., .4) denote sets and the cardinality of the set 
is denoted by \A\. Subscripts are used to specify node and 
time indicies. For example, X u and X u ^ denotes the signal 
sent at node u and the signal sent at node u at time i, 
respectively. To represent a sequence of random variables we 
use the notation X™ = X Vj i, . . . , X Vy7l . We will frequently 
use random variables subscripted by sets to denote the set 
of random variables indexed with elements in the set. For 
example, X A = {X a :ae A} and X\ = {X™ :aeA}. 

We consider a network Q — (V, £ ) where V and £ are the 
set of nodes and directed edges, respectively. Without loss of 
generality, we let V = {1, . . . , |V|} and index the source node 
with 1. We use V and K = V - ({1} UP) to denote the set of 
destination nodes and relay nodes respectively. The network 
has one channel input X u G X u associated with each node 
u G V, where X u is the alphabet of X u . This incorporates the 



broadcast nature of the network. Each node v G V observes 

Y v = f v {X Mv ,S), (1) 

where the input neighbors AT V of v is defined as J\f v = 
{u : (u,v) G £}. The random variable S is a random 
state affecting nodes, which is independent of the source 
message. The state sequence is memoryless and stationary with 
p(s n ) = nr=i^( 5 *)- We assume that each destination d G V 
has side information of the state sequence. The source node 
wishes to send a common message m G [2 nR ] = {1, . . . , 2 nR } 
to all destination nodes. 

A (2 nR , n) code consists of a source encoding function 
4>i, relay encoding functions (j) Vji , v G V — ({1} U V), 
i G {1, . . . , n}, and decoding functions ipd, d EV, where 

01 : [2 nR ] - 

: X -1 A^, i E {1, . . . , n}, v e 7£, 

where M is uniformly distributed over [2 nR ] . The probability 
of error is defined by 

p(n) = p r {^ d (yn ? ^n) ^ M for SQme d e £>} 

A rate is said to be achievable if there exist a sequence of 
(2 nR ,n) codes with P e (n) as n — > oo. 

For each d G V, a cut % C V is a subset of nodes such 
that 1 G Ud and d e We will omit the destination index 
when it is clear from the context. We define a boundary of 
a cut as d(U) = {u : (u, e £,u e U,v e U c } and the 
boundary of a complement of a cut as B(U C ) — {v : (u, v) G 
£,ueU,v eU c }. 

We say that a node v is in /oyer I if all directed paths 
from the source to v has I hops. Let L be the longest distance 
from the source node to any node. We say that a network is 
layered with L layers if every node in V belong to some layer 
I G {0, . . . , L}. The set of nodes in layer I is denoted by V/. 
Without loss of generality we will assume that Vo = {1}. 

For a random variable X ~ p(x), the set tJ 77 ^ of e-typical 
n- sequences x n is defined Q as 

T e (n) 4 {x n : |7r(a|x n ) -p(a)| <£-p(a),Va G AT} 

where 7r(a|x n ) is the relative frequency of the symbol a in 
the sequence x n . 

III. Main result 

A. General state dependent networks 

Given a class of relay networks as defined in (Q}, the 
multicast capacity C is upper bounded by 

C < max mm mm I X^c S). (2) 

P (x v ) dev u d d ' d 

The upper bound is from the cut- set bound (U Theorem 
15.10.1] by treating the state information as additional outputs 
to the destinations, and using the fact that the state sequences 
are independent of the message, the memoryless property of 
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Fig. 1. Example of an erasure network. Su,v are erasures events for links 
(u, v) G S. 



the channel, and the deterministic nature of the channel given 
S. 

Remark 1: The cut-set bound is given by ([2]) whether we 
assume that the relay nodes have state information or not, 
as long as the state information at the relays are causal (i.e., 
%v,i = fiv^iyl -1 , s 1 )) and destination nodes have the state 
information. 

As our main result we state the following theorem. 

Theorem 1: For the multicast relay network Q = (V, £) in 
(Q}, if all destination nodes in V have side information of the 
state, then the capacity C of the network is lower bounded by 

C> max min min H(Yu c \Xu c ,S). (3) 

YlievP( x i) deV Ud d 

The proof of this theorem will be given in Sections [TV] and [Vj 

Remark 2: Theorem Q] includes the special case of unicast 
networks if \V\ = 1. 

Example 1 (fi5\ Theorem 1 ]): Consider a network with out- 
put symbols Y v = {Y UjV : u G Af v }, where Y U:V is 
the observation at node v through the edge (u,v). Thus, 
the receiving nodes receives a separate output for each link 
connected to the node, i.e., has no interference. Let the output 
random variables take values from 3^ = X U {e}, where the 
symbol e is the erasure symbol. Each channel output Y UiV is 
given by the transmitted signal X u with probability 1 — e UyV 
or an erasure symbol e with probability e UyV . Let S UyV ^ be a 
random variable indicating erasure occurrence across channel 
(u, v) G £ at time i. If an erasure occurs on link (u, v) G £ 
at time i, the value of S UiVj i will be one, otherwise zero. Let 
S n = {S™ v : u G J\f v }. If the destination nodes have the 
S n sequence as side information, this channel falls into the 
channel model described in Section [III since the output at each 
relay is a function of the incoming signals and S n . It can 
be shown that the cut- set bound is achieved by the uniform 
product distribution. Hence, the capacity of this channel is 
given by © with equality. 

B. Linear finite field fading networks 

Consider a finite field (GF(q)) network in which each node 
v G V observes 

Y v = ^ ] S U ^ V X U (4) 



where Y v , X u ,u G Af v , S u ^ Vj u G Af v , are in GF(q). If we 
assume that S u , v ,V(u,v) G £ is known at the destination 
nodes, this channel falls into the class of channels in Section 

m 

Let Y^c and X^ be vectors of observations in d(U c ) and 
input signals in d(U), respectively. These are of observations 
and input signals of nodes that have an edge passing through 
the cut. We define a transfer matrix of an arbitrary cut U as 
Gu such that it satisfies Y^ c = G^X^. 

Thus, the random matrix Gu consists of zeros when there 
is no connection between the nodes and S u , v if u G d(U) and 
v G d(U c ). The column index represents the sending node 
index in d(U) and row index represents the receiving node 
index in d(U c ). For the example in Figure [T] we have the 
expression 
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for the cut U = {1,2}. 

Theorem 2: The multicast capacity of the linear finite field 
fading network © is 

C = minmin£'[rank(Gw d )] logg. 

dev lid 

Proof: Proof is omitted due to space limitations. ■ 
Remark 3: For the special case of S G {0, 1}, Theorem 
[2] includes the capacity result for linear finite field erasure 
networks with broadcast and interference. 

IV. Proof of Theorem [T]for Layered networks 

We begin by showing the achievabililty of Theorem [T] for 
layered networks with V = {d}. The multicast network is a 
simple extension of the single destination network and will be 
treated later. 

We use a block Markov encoding scheme in which we 
divide the message m into K parts m&, k G {1, . . . , K}. We 
code in K + L — 1 blocks of length n. Message rrik takes 
values from [2 nR ] for all k and the overall rate is given by 
(K+L-i) w hich approaches R as K — > oo. 

We will use two types of indexing for the inputs, outputs, 
and state. We will use s n (j) to denote the state sequence when 
message rrij is being sent at the source node. For the set of 
observations and input sequences at layer I carrying message 
rrij, we will use the notation 

Vv, K) = K), s n (j + l)):veV l } (5) 

and 

x^rrij) ± {x^y^rrij)) : v EVi}, (6) 

respectively. For example, (0) denotes the set of observation 
sequences of the nodes in layer I when rrij is received. Due to 
the layered structure of the network and the coding strategy, 
which will be explained in the following, the observation 
sequences corresponding to the jth message at layer I are 



functions of s n (j + I). This will be explained in more detail 
in the following. 

Codebook generation: Fix p(x u ) for all u G V — {d}. 
Randomly and independently generate 2 nR sequences x™{m), 
m G [2 nR ], each according to YYi^iP( x ^,i)- For each u G V — 
{1}, randomly and independently generate x™(y™) sequences 
for each y™ G y™, according to ]Xi=iPi. x u,i)' 

Encoding: To send message rrij, j G {1, . . . ,K} 9 the 
encoder sends x^rrij), while at each layer I, node v G Vi 
sends x™{y™(mj-i)). 

Decoding: When the destination receives y2(rrij), it also has 
{s n (l), . . . , s n (j + L)} from previous observations. Assuming 
the previous blocks were decoded with arbitrarily small error, 
the receiver declares that a message was sent if it is a unique 
index rrij G [2 nR ] such that 

1=0 

otherwise an error is declared. 

From the encoding we can see that there is a I block delay 
at layer I, I = {1, . . . ,L}. When the source sends message 
rrij, the relays in layer 1 send x^^rrij-i), the relays in layer 
2 send Xy 2 (mj_2) and so on. Accordingly, when the source 
sends the jth block, received observation sequence of node 
v G Vi is a function of ■ (rrij-i-i) and s n (j), which gives 
©. Table U shows the coding strategy for a simple diamond 
network given in Figure [2] 

The decoding is a typicality check over an intersection of 
disjoint sets. Recall that from © and ©, as message rrij 
traverses through the network, the message is being affected 
by a different state at each layer. Therefore, we require that 
all inputs and outputs of that layer and a state (corresponding 
to the specific block time) are uniquely jointly typical. 

Before dealing with arbitrarily large networks, we will 
first give a proof for a simple diamond network. Consider 
a diamond network depicted in Fig. [2] at the top of the next 
page. The relay nodes {a, b} in layer 1 receives Y a , Y^ which 
are deterministic functions of X\ and S. The destination node 
in layer 2 observes Yd, which is a deterministic function of 
X a , Xb, and S. Without loss of generality, we will assume 
that rrij = 1 was sent, and show the decoding and probability 
of error analysis for the jth block. We will omit the message 
index for simplicity. There are two types of error events: 

£ = (A\ H A\) c and E 1 = [j (A™ n A™) 

where 

Af 4 {(X?(m),Y a n (m),Y b n (m),S n (j)) e lf>} , 

and 

AT = {(X2(m),XZ(m),Y d n (l),S n (j + 1)) G Tj">} . 

For the first error event, we have P{Eq) — > as n — > oo by 
the law of large numbers. We will decompose E\ into four 



TABLE I 

Coding strategy of the deterministic diamond network with state in Fig. [2] 



Block index 


Layer 


Layer 1 observes 


Layer 1 transmits 


Layer 2 observes 


State 


j 












i/dK-i' sn 0')) 




j+1 


x™(m j+1 ) 










y2(mj,s n (j + l)) 


s n (j + l) 


j+2 




yZ(m J+2 ,s n (j + 2)) 


yZ(m J+2 ,s n (j + 2)) 


x a( m 3 + l) 




yS(m i+ i,^0" + 2)) 


S -(j + 2) 



disjoint events. Let 

#Q = {^(m) ^ YS(1), YSc(m) = 1^(1)} 

where Q C {a, 6} and Q c = {a, 6} — Q. We have four such 
events since {a, b} has four subsets. Then the probability of 
Ei is given by 

p(e 1 )=p\ U(^rn^) 



f a (Xi,S) : X a 



< P{A?C\A%} 



(7) 



ra/1 



= X1 E p Kn^n^} (8) 

where in © we have used the union bound and ([5]) is from 
the fact that Bq are partitions that cover the whole set. Thus, 
we have decomposed Ei into four disjoint events. The event 

A rn n B 7n^ implies 



'} 



(9) 



{(xr(m), F a "(m), F b "(l), 5"(j)) G TW 
and A™ D implies 

{(X a "(m),X b "(l),17(l),5"(i + 1)) G lf>} (10) 

since Xg(m) = X£(Y^(m)). Since © and (ITQb are indepen- 
dent events, we have 

P{ifn4^n5 { m a} } 

< 2-n(I(X 1 ,Y a ;Y b \S)-3e)2-n(I(X a ;Y d \X b ,S)-3e) 
= 2-n(H(y b ,ld|5,X 6 )-6e) 

where in the last step we have used the Markov structure of 
the network. Similar to the previous steps, we can bound the 
other events by 

P{A™ n A™ n Bf b} } < 2 -^( y a,Yd|5,x a )-6e) 5 (12) 

P{A™ n A™ n < 2- n W y «^i^ x «^)- 6c ), (13) 

and 

p{A™ nA^n £^ 6} } < 2-< H ^ s ^ e \ (14) 

Combining ©, (Qj), CGk C3, and d, we get P(E'i) -> 
as n — > oo if 

f #(y a ,n,y d |s,x a ,x 6 )-6e, 

ff(y d |5)-3e, 

if(y 6 ,y d |5,x 6 )-66, 
#(y a ,y d |s,x a )-6e 



R < min < 



(15) 



Xi(m) 




id = fd{X a ,X b ,S) 



AT = 



Y b = f b (X u S):X b 

Fig. 2. Deterministic diamond network with state. 

which concludes the proof for the diamond network. 

We now move on to the proof of Theorem [T] for general 
layered networks. We will show the proof for decoding mes- 
sage rrij , and define similar events as in the diamond network 
to lead us through the proof. As before, we omit the message 
index for simplicity. Let 

{ (X£ (m), Y% +1 (m), S n (j + I)) G } . (16) 

Notice that we are abusing notation for the destination obser- 
vation in (fT6l) . For A™_ 1 , Yft (m) should be Fj\ which is 
the given observation at the destination and is not tested for 
typicality. 

Assuming m = 1 was sent, we have two sources of error: 
E ±( L f)At) and 2*4 (J 

The error event P(Eq) — > as n — > oo. As we did in the 
diamond network case we will decompose the error event Ei 
with each Bq, Q^IZ. The probability of Ei is given by 

p{e 1 )=p \ un^r 

[ra^l 1=0 



ra/1 QCU I 1=0 



(17) 



where the inequality is due to the union bound and the last 
step is due to partitioning the events. The event A™ D Bq 
implies 

{(x^(m),X^(l),YS l+1 (m),YS f+1 (l),S n (j + l)) 



where Qi = Vi D Q and Qf = Vi~ Q\. Then, 

(L-l ^ L-l 

p \ n a ™ n b q \ ^ n 2 



. Z=0 



/=0 
L-l 



duplicate paths of the original network. Using Lemma 6.2 in 
El (by including a state random variable in the conditional 
entropies) we have the relation 

(T + N -1) mm H(Y Uc \X U c 1 S)<H(Y ac \X ac ,S) (20) 



n -n{H{Y Q c jX Q c,5)-3e) 
1 /+l • where AT = 2l y l" 2 . We also have for any distribution, 



i(H(Y Qf+i \X Qf ,S)-3e) 



1=0 

From dT71) and (TT81) we get 

= ^ ^ 2 -^Ezt"o 1 (^(^Qf +1 I^Qf^)- 3e ) 
m /l QC1Z 

< 2 nR 2~ n ^ J ° liHiYQ ?+i lXQ f> S) - 3e) 

= ^ 2 nR 2~ n ( H{yuc \ Xuc ' S) ~ e,) 
Qcn 

where e' = 3Le and ZY C = {Q c ,d} which gives a cut in the 
network. Thus, P(E\) — > as n — > oo if 

i?<mini^(F W c|X W c,5)-e / , 

which proves Theorem [T] for layered networks with a single 
destination. 

Remark 4: Consider a semi-deterministic layered network 
Q = (V, £ ) where each node v G V — {d} observes Y v = 
fv(X/sf v ,Y d ) and the final destination gets Y d ~ p(yd\xAT d ), 
i.e., a stochastic output. Using the coding scheme above we 
can show that all rates R that satisfies 

R < max mm I (Xu;Yuc\Xuc) 

are achievable for unicast. 

For the multicast scenario we declare an error if any of 
the nodes in V makes an error. Using the union bound and 
the same line of proof as in Section [IV] for each d G V, we 
can show that the probability of error is arbitrarily small for 
sufficiently large n if 

R< max mm mm H (Yn c \Xu c , S). 

UievPixi)^ U * d d 

V. Arbitrary networks 

For extending the layered network result to arbitrary net- 
works we use the same line of proof as done in [3) that unfolds 
Q into a time-extended network. We will just give an outline 
of the proof. For more details on unfolding Q, we refer to 
|3] due to space limitations. Given an arbitrary network Q, 
we unfold the original network over T stages to get a layered 
network Q. Using the coding scheme for the unfolded layered 
network, we can achieve 

i?<i max mmH{Y Qc \X Qc ,S) (19) 

1 rUevPOEi) U 



mm H(Y Uc \X Uc , S) < T mm H(Y U c \X U . , S) , 
u u 



(21) 



since the right hand side corresponds to taking the minimum 
over only steady cuts (subset of all possible cuts). Combining 
(|20t with (EB we have 

lim )- max mm H (Y Uc \X Uc , S) 

< max mm H(Yuc\Xuc : S). 

Finally, with the relations (TT91) and (BTt . we can show that rates 
arbitrary close to the right hand side of © are achievable for 
sufficiently large T. 
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where U is a cut in the unfolded network. We normalize 
the right hand side by T since the network gives at most T 



